Text Categorization through Multistrategy Learning and Visualization
نویسندگان
چکیده
This paper introduces a multistrategy learning approach to the categorization of text documents. The approach benefits from two existing, and in our view complimentary, sets of categorization techniques: those based on Rocchios algorithm and those belonging to the rule learning class of machine learning algorithms. Visualization is used for the presentation of the output of learning.
منابع مشابه
From Data Mining to Knowledge Mining
In view of the tremendous production of computer data worldwide, there is a strong need for new powerful tools that can automatically generate useful knowledge from a variety of data, and present it in human-oriented forms. In efforts to satisfy this need, researchers have been exploring ideas and methods developed in machine learning, statistical data analysis, data mining, text mining, data v...
متن کاملInitial Considerations toward Knowledge Mining
In view of the tremendous production of computer data worldwide, there is a strong need for new powerful tools that can automatically generate useful knowledge from a variety of data, and present it in human-oriented forms. In efforts to satisfy this need, researchers have been exploring ideas and methods developed in machine learning, statistical data analysis, data mining, text mining, data v...
متن کاملImproving the Operation of Text Categorization Systems with Selecting Proper Features Based on PSO-LA
With the explosive growth in amount of information, it is highly required to utilize tools and methods in order to search, filter and manage resources. One of the major problems in text classification relates to the high dimensional feature spaces. Therefore, the main goal of text classification is to reduce the dimensionality of features space. There are many feature selection methods. However...
متن کاملLink Analysis in Www
DEFINITION The information age has made it easy to store large amounts of data. The proliferation of documents available on the Web is rapidly growing. Search engines only worsen the problem by making more and more documents available in just a few key strokes. Link Analysis is a new, exciting and rapidly growing area of research that tries to solve the information overload problem by using tec...
متن کاملData Mining and Knowledge Discovery: A Review of Issues and a Multistrategy Approach
An enormous proliferation of databases in almost every area of human endeavor has created a great demand for new, powerful tools for turning data into useful, task-oriented knowledge. In efforts to satisfy this need, researchers have been exploring ideas and methods developed in machine learning, pattern recognition, statistical data analysis, data visualization, neural nets, etc. These efforts...
متن کامل